An Index-Based Method for Timestamped Event Sequence Matching

نویسندگان

  • Sanghyun Park
  • Jung-Im Won
  • Jeehee Yoon
  • Sang-Wook Kim
چکیده

This paper addresses the problem of timestamped event sequence matching, a new type of sequence matching that retrieves the occurrences of interesting patterns from a timestamped event sequence. Timestamped event sequence matching is useful for discovering temporal causal relationships among timestamped events. In this paper, we first point out the shortcomings of prior approaches to this problem and then propose a novel method that employs an R∗-tree to overcome them. To build an R∗-tree, it places a time window at every position of a timestamped event sequence and represents each window as an n-dimensional rectangle by considering the first and last occurrence times of each event type. Here, n is the total number of disparate event types that may occur in a target application. When n is large, we apply a grouping technique to reduce the dimensionality of an R∗-tree. To retrieve the occurrences of a query pattern from a timestamped event sequence, the proposed method first identifies a small number of candidates by searching an R∗tree and then picks out true answers from them. We prove its robustness formally, and also show its effectiveness via extensive experiments.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A multi-dimensional indexing approach for timestamped event sequence matching

This paper addresses the problem of timestamped event sequence matching, a new type of similar sequence matching that retrieves the occurrences of interesting patterns from timestamped sequence databases. The sequential-scan-based method, the trie-based method, and the method based on the iso-depth index are well-known approaches to this problem. In this paper, we point out their shortcomings, ...

متن کامل

Querying Timestamped Event Sequences by Exact Search or Similarity-based Search: Design and Empirical Evaluation

Specifying timestamped event sequence queries is challenging even for skilled computer professionals familiar with SQL. Most graphical user interfaces for database search use a exact search approach, which is often effective, but applies an exact match criteria. We describe a new similarity-based search interface, in which users specify a query by simply placing events on a blank timeline and r...

متن کامل

Measurement of Left Ventricular Myocardium Wall Instantaneous Motions with Echocardiographic Sequence Images

Background & Aims: One of the important aims of quantitative cardiac image processing is the clarification of myocardial motions in order to derive biomechanical behavior of the heart in the disease condition. In this study we presented a computerized analysis method for detecting the instantaneous myocardial changes by using 2D echocardiography images. Methods: The analysis was performed on th...

متن کامل

An edit operation-based approach to approximate string matching in large DNA databases

In DNA related research, due to various environment conditions, mutations occur very often, where a mutation is defined as a heritable change in the DNA sequence. Therefore, approximate string matching is applied to answer those queries which find mutations. The problem of approximate string matching is that given a user specified parameter, k, we want to find where the substrings, which could ...

متن کامل

How to improve efficiency of analysis of sequential data?

Many of todays database applications, including market basket analysis, web log analysis, DNA and protein sequence analysis utilize databases to store and retrieve sequential data. Commercial database management systems allow to store sequential data, but they do not support efficient querying of such data. To increase the efficiency of analysis of sequential data new index structures need to b...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005